False discovery rates: a new deal

نویسنده

  • Matthew Stephens
چکیده

We introduce a new Empirical Bayes approach for large-scale hypothesis testing, including estimating false discovery rates (FDRs), and effect sizes. This approach has two key differences from existing approaches to FDR analysis. First, it assumes that the distribution of the actual (unobserved) effects is unimodal, with a mode at 0. This "unimodal assumption" (UA), although natural in many contexts, is not usually incorporated into standard FDR analysis, and we demonstrate how incorporating it brings many benefits. Specifically, the UA facilitates efficient and robust computation-estimating the unimodal distribution involves solving a simple convex optimization problem-and enables more accurate inferences provided that it holds. Second, the method takes as its input two numbers for each test (an effect size estimate and corresponding standard error), rather than the one number usually used ($p$ value or $z$ score). When available, using two numbers instead of one helps account for variation in measurement precision across tests. It also facilitates estimation of effects, and unlike standard FDR methods, our approach provides interval estimates (credible regions) for each effect in addition to measures of significance. To provide a bridge between interval estimates and significance measures, we introduce the term "local false sign rate" to refer to the probability of getting the sign of an effect wrong and argue that it is a superior measure of significance than the local FDR because it is both more generally applicable and can be more robustly estimated. Our methods are implemented in an R package ashr available from http://github.com/stephens999/ashr.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The False Discovery Rate in Simultaneous Fisher and Adjusted Permutation Hypothesis Testing on Microarray Data

Background and Objectives: In recent years, new technologies have led to produce a large amount of data and in the field of biology, microarray technology has also dramatically developed. Meanwhile, the Fisher test is used to compare the control group with two or more experimental groups and also to detect the differentially expressed genes. In this study, the false discovery rate was investiga...

متن کامل

Hierarchical False Discovery Rates: Large-scale Inference for Plate-based High-throughput Phenotyping Methods by

Hierarchical False Discovery Rates: Large-scale Inference for Plate-based High-throughput Phenotyping Methods Hannes Bretschneider 2011 This thesis introduces Hierarchical false discovery rates, a new semi-parametric Bayesian method for the detection of causal links between a genotype and phenotype in high-throughput phenotypic studies. Hierarchical false discovery rates are designed for plate-...

متن کامل

Controlling False Alarm/Discovery Rates in Online Internet Traffic Classification

Classifying Internet traffic flows online into applications or broader classes without inspecting the packet payloads or without relying on port numbers has become a necessity for network operators. The operators can use this information to monitor their networks and provide per-class quality of service. There has been a great deal of research done on Internet traffic classification recently an...

متن کامل

An effective method for controlling false discovery and false nondiscovery rates in genome-scale RNAi screens.

In most genome-scale RNA interference (RNAi) screens, the ultimate goal is to select siRNAs with a large inhibition or activation effect. The selection of hits typically requires statistical control of 2 errors: false positives and false negatives. Traditional methods of controlling false positives and false negatives do not take into account the important feature in RNAi screens: many small-in...

متن کامل

False Discovery Rates and the James-Stein Estimator

The new century has brought us a new class of statistics problems, much bigger than their classical counterparts, and often involving thousands of parameters and millions of data points. Happily, it has also brought some powerful new statistical methodologies. The most prominent of these is Benjamini and Hochberg’s False Discovery Rate (FDR) procedure, extensively explored in this issue of Stat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2017